The Best - Performance Digital Video Recorder JPEG 2000 DVR
نویسنده
چکیده
JPEG 2000, the new ISO/ITU-T standard for still image coding, is about to be finished. Other new standards have been recently introduced, namely JPEG-LS and MPEG-4 VTC(H.264). This paper compares the set of features offered by JPEG 2000, and how well they are fulfilled, versus JPEG and MPEG-4 VTC(H.264). The study concentrates on compression efficiency and functionality set, while addressing other aspects such as complexity. Lossless compression efficiency as well as the fixed and progressive lossy rate-distortion behaviors are evaluated. Robustness to transmission errors, Region of Interest coding and complexity are also discussed. The principles behind each algorithm are briefly described. The results show that the choice of the “best” standard depends strongly on the application at hand, but that JPEG 2000 supports the widest set of features among the evaluated standards, while providing superior rate-distortion performance in most cases. INTRODUCTION It has been three years since the call for proposals 1 for the next ISO/ITU-T standard for compression of still images, JPEG 2000, has been issued. Now JPEG 2000 Part I (the core system) is in its final stage to become an International Standard (IS). It has been promoted to Final Committee Draft (FCD) 2 in March 2000 and will reach IS status by the end of the same year. A great effort has been made to deliver a new standard for today's and tomorrow's applications, by providing features inexistent in previous standards, but also by providing higher efficiency for features that exist in others. Now that the new standard is nearing finalization, a trivial question would be: what are the features offered by JPEG 2000 but also how well are they fulfilled when compared to other standards offering the same features. This paper aims at providing an answer to this simple but somewhat complex question. Section 2 provides a brief overview of the techniques compared, with special attention on new features of JPEG 2000 such as Region of Interest (ROI) coding. Section 3 explains the comparison methodology employed in the results shown in section 4 and conclusions are drawn in section 5. Appendix JPEG2000 DVR V.S M-PEG & MPEG4 (H.264) OVERVIEW OF STILL IMAGE CODING STANDARDS For the purpose of this study we compare the coding algorithm in JPEG 2000 standard to the following three standards: JPEG 3 , MPEG-4 Visual Texture Coding (VTC) 4 The reasons behind this choice are as follows. JPEG is one of the most popular coding techniques in imaging applications ranging from Internet to digital photography. MPEG-4 VTC is very recent standards that start appearing in various applications. It is only logical to compare the set of features offered by JPEG 2000 standard not only to those offered in a popular but older standard (JPEG), but also to those offered in most recent ones using newer state-of-the-art technologies. JPEG This is the very well known ISO/ITU-T standard created in the late 1980s. There are several modes defined for JPEG 3 , including baseline, lossless, progressive and hierarchical. The baseline mode is the most popular one and supports lossy coding only. The lossless mode is not popular but provides for lossless coding, although it does not support lossy. In the baseline mode, the image is divided in 8x8 blocks and each of these is transformed with the DCT. The transformed blocks are quantized with a uniform scalar quantizer, zig-zag scanned and entropy coded with Huffman coding. The quantization step size for each of the 64 DCT coefficients is specified in a quantization table, which remains the same for all blocks. The DC coefficients of all blocks are coded separately, using a predictive scheme. Hereafter we refer to this mode simply as JPEG. The lossless mode is based on a completely different algorithm, which uses a predictive scheme. The prediction is based on the nearest three causal neighbors and seven different predictors are defined (the same one is used for all samples). The prediction error is entropy coded with Huffman coding. Hereafter we refer to this mode as L-JPEG. The progressive and hierarchical modes of JPEG are both lossy and differ only in the way the DCT coefficients are coded or computed, respectively, when compared to the baseline mode. They allow a reconstruction of a lower quality or lower resolution version of the image, respectively, by partial decoding of the compressed bitstream. Progressive mode encodes the quantized coefficients by a mixture of spectral selection and successive approximation, while hierarchical mode uses a pyramidal approach to computing the DCT coefficients in a multi-resolution way. MPEG-4 VTC (H.264) MPEG-4 Visual Texture Coding (VTC) is the algorithm used in MPEG-4 4 to compress visual textures and still images, which are then used in photo realistic 3D models, animated meshes, etc., or as simple still images. It is based on the discrete wavelet transform (DWT), scalar quantization, zero-tree coding and arithmetic coding. The DWT is dyadic and uses a Daubechies (9,3) tap biorthogonal filter. The quantization is scalar and can be of three types: single (SQ), multiple (MQ) and bi-level (BQ). With SQ each wavelet coefficient is quantized once, the produced bitstream not being SNR scalable. With MQ a coarse quantizer is used and this information coded. A finer quantizer is then applied to the resulting quantization error and the new information coded. This process can be repeated several times, resulting in limited SNR scalability. BQ is essentially like SQ, but the information is sent by bitplanes, providing general SNR scalability. Two scanning modes are available: tree-depth (TD), the standard zero-tree scanning, and band-by-band (BB). Only the latter provides for resolution scalability. The produced bitstream is resolution scalable at first, if BB scanning is used, and then SNR scalable within each resolution level, if MQ or BQ is used. A unique feature of MPEG-4 VTC is the capability to code arbitrarily shaped objects. This is accomplished by the means of a Appendix JPEG2000 DVR V.S M-PEG & MPEG4 (H.264) shape adaptive DWT and MPEG-4’s shape coding. Several objects can be encoded separately, possibly at different qualities, and then composited at the decoder to obtain the final decoded image. On the other hand, MPEG-4 VTC does not support lossless coding. JPEG 2000 JPEG 2000, as noted previously, is the next ISO/ITU-T standard for still image coding. In the following, we restrict the description to Part I of the standard, which defines the core system. Part II will provide various extensions for specific applications, but is still in preparation. JPEG 2000 is based on the discrete wavelet transform (DWT), scalar quantization, context modeling, arithmetic coding and post-compression rate allocation. The DWT is dyadic and can be performed with either the reversible Le Gall (5,3) taps filter, which provides for lossless coding, or the non-reversible Daubechies (9,7) taps biorthogonal one, which provides for higher compression but does not do lossless. The quantizer follows an embedded dead-zone scalar approach and is independent for each sub-band. Each sub-band is divided into rectangular blocks (called code-blocks in JPEG 2000), typically 64x64, and entropy coded using context modeling and bit-plane arithmetic coding. The coded data is organized in so called layers, which are quality levels, using the post-compression rate allocation and output to the code-stream in packets. The generated code-stream is parseable and can be resolution, layer (i.e. SNR), position or component progressive, or any combination thereof. JPEG 2000 also supports a number of functionalities, many of which are inherent from the algorithm itself. Examples of this is random access, which is possible because of the independent coding of the code-blocks and the packetized structure of the codestream. Another such functionality is the possibility to encode images with arbitrarily shaped Regions of Interest (ROI). The fact that the subbands are encoded bitplane by bitplane makes it possible to select regions of the image that will precede the rest of the image in the codestream. By scaling the sub-band samples so that the bitplanes encoded first only contain ROI information and following bitplanes only contain background information. The only thing the decoder needs to receive is the factor by which the samples were scaled. The decoder can then invert the scaling based only on the amplitude of the samples. Other supported functionalities are error-resilience, random access, multicomponent images, palletized color, compressed domain lossless flipping and simple rotation, to mention a few. Table . Functionality matrix. A “+” indicates that it is supported, the more “+” the more efficiently or better it is supported. A “-” indicates that it is not supported. JPEG 2000 JPEG MPEG-4 VTC(H.264) lossless compression performance +++ + lossy compression performance +++++ +++ ++++ progressive bitstreams +++++ ++ +++ Region of Interest (ROI) coding +++ + arbitrary shaped objects ++ random access ++ low complexity ++ +++++ + error resilience +++ ++ +++ non-iterative rate control +++ + genericity +++ ++ ++ Appendix JPEG2000 DVR V.S M-PEG & MPEG4 (H.264) MPEG-4 VTC, as JPEG 2000, is able to produce progressive bitstreams without any noticeable overhead. However, the latter provides more progressive options and produces bitstreams that are parseable and that can be rather easily reorganized by a transcoder on the fly. Along the same lines, JPEG 2000 also provides random access (i.e. involving a minimal decoding) to the block level in each sub-band, thus making possible to decode a region of the image without having to decode it as a whole. These two features could be very advantageous in applications such as digital libraries. Another result of the fact that JPEG2000 generates progressive bitstream is the Region of Interest functionality. As shown in the previous section it is possible to generate a bitstream in which specific areas of the image are refined earlier than the rest. The bitstream can be organized so that either the image is stored with the same quality for the ROI and the rest of the image, or if the bitstream is truncated, the ROI will have better quality than the background. Concerning error resilience JPEG 2000 offers higher protection than JPEG, as shown in the previous section. MPEG-4 VTC also offers error resilience features and although it could not be evaluated, the support is expected to be in between JPEG and JPEG 2000. Overall, one can say that JPEG 2000 offers the richest set of features and provides superior rate-distortion performance. However, this comes at the price of additional complexity when compared to JPEG, which might be currently perceived as a disadvantage for some applications, as was the case for JPEG when it was first introduced.
منابع مشابه
Design, Development and Deployment of Web-Based
This report presents the development process of a web-based digital video recorder system (DVR). The conventional DVR system is mainly designed and used for local program recording. We developed a new DVR system that can be accessible from any where in the world through the web interface. By using the new system, a user can record and watch TV programs across countries.
متن کاملApply Semi-fragile Watermarking to Authentication of Compressed Video Data
This paper proposes an effective technique, which can detect malicious manipulations under video lossy compressing data (e.g. H.263) and still-image lossy compressing data (e.g. JPEG). A blockclassification strategy is used to divide DCT-blocks into the flat-blocks and the normal-blocks. Simple features of the both blocks are embedded invisibly. For later authentication, the watermarked frame i...
متن کاملComparative Analysis of H.264 and Motion-jpeg2000 Compression for Video Telemetry
The H.264/AVC standard, popular in commercial video recording and distribution, has also been widely adopted for high-definition video compression in Intelligence, Surveillance and Reconnaissance and for Flight Test applications. H.264/AVC is the most modern and bandwidth-efficient compression algorithm specified for video recording in the Digital Recording IRIG Standard 106-11, Chapter 10. Thi...
متن کاملThe Competitive Impact of Targeted Television Advertisements Using DVR Technology
Digital video recorders (DVRs) are an emerging technology that is fundamentally changing the competitive landscape in industries that advertise on television. Perhaps the most familiar impact of DVR technology is that it enables consumers to avoid advertisements by fast forwarding through them. However, this zipping of ads is only one aspect of the impact of DVR technology. DVRs also collect ...
متن کاملCoping with TiVo: Opportunities of the networked digital video recorder
Despite the progressive switch to digital TV, there has not been any significant change to the value chain of the TV industry. At the same time, the introduction of novel information and communication technologies, such as the digital video recorder (DVR) and efficient Peer-to-Peer (P2P) content distribution, have been regarded as a threat to the established broadcast business players. Previous...
متن کاملMitigation Technique For Voltage Sag & Swell By Using Dynamic Voltage Restorer
This paper presents the application of dynamic voltage restorers (DVR) on power distribution Systems for mitigation of voltage sags/swells at critical loads. DVR is one of the compensating types of custom power devices. An adequate modeling and simulation of DVR, including controls in MATLAB, show the flexibility and easiness of the MATLAB environment in studying and Understanding such compensa...
متن کامل